AITopics | hyper parameter

Collaborating Authors

hyper parameter

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

580c4ec4738ff61d5862a122cdf139b6-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 03:45:01 GMT

dirichlet, rlw, section 4, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

309fee4e541e51de2e41f21bebb342aa-Supplemental.pdf

Neural Information Processing SystemsFeb-7-2026, 23:54:53 GMT

coefficient, commitment loss coefficient, loss coefficient, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

A Additional HQA Results Table 5: Additional CelebA interpolations of the HQA encoder output z

Neural Information Processing SystemsOct-2-2025, 14:27:59 GMT

Compression is from 98,304 to 576 bits (171x compression). Compression is from 98,304 to 144 bits (683x compression). The far left and right images are originals. B.1 Motivation In this section we outline the probabilistic model that motivates the HQA loss: L = log p (x | z = k) H [ q ( z |x)] + E A desired property of the HQA, motivated in Section 4.4, is the non-deterministic posterior We contrast these two models in Figure 8. This model is a V ariational Autoencoder with a simple Mixture of Gaussians prior.

artificial intelligence, commitment loss coefficient, machine learning, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

DominoSearch: Find layer-wise fine-grained N: M sparse schemes from dense neural networks - Supplementary Material

Neural Information Processing SystemsAug-16-2025, 18:40:25 GMT

Section 2: Experimental study of a different policy with fixed N and flexible M. Section 3: Sensitivity of hyper-parameter β In the main paper, we assume a policy with fixed M and flexible N. Furthermore, we also use a design space with N equal to a power-of-two. This is achieved by transforming the schemes of fixed M. For instance, 8:16, 4:16, 2:16 and 1:16 will be transformed as 1:2, 1:4, 1:8 and 1:16 with fixed N (1) and flexible M (2,4,8,16). Results are shown in Table 3. Figure 1 and 2 illustrate the differences between 1:2 and 2:4 with the same dense weight matrix and sparsity (i.e. Details can be found in Section 3.4 of the main paper. It consists of more than 1.2 million training images and Each image is labelled as one of 1K classes.

artificial intelligence, dominosearch, machine learning, (16 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

95323660ed2124450caaac2c46b5ed90-Supplemental.pdf

Neural Information Processing SystemsAug-16-2025, 03:22:37 GMT

artificial intelligence, machine learning, slac, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.37)

Add feedback

580c4ec4738ff61d5862a122cdf139b6-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-15-2025, 00:09:51 GMT

dirichlet, rlw, section 4, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Advanced Deep Learning Techniques for Analyzing Earnings Call Transcripts: Methodologies and Applications

Zakir, Umair, Daykin, Evan, Diagne, Amssatou, Faile, Jacob

arXiv.org Artificial IntelligenceFeb-26-2025

This study presents a comparative analysis of deep learning methodologies such as BERT, FinBERT and ULMFiT for sentiment analysis of earnings call transcripts. The objective is to investigate how Natural Language Processing (NLP) can be leveraged to extract sentiment from large-scale financial transcripts, thereby aiding in more informed investment decisions and risk management strategies. We examine the strengths and limitations of each model in the context of financial sentiment analysis, focusing on data preprocessing requirements, computational efficiency, and model optimization. Through rigorous experimentation, we evaluate their performance using key metrics, including accuracy, precision, recall, and F1-score. Furthermore, we discuss potential enhancements to improve the effectiveness of these models in financial text analysis, providing insights into their applicability for real-world financial decision-making.

sentiment analysis, transcript, transcript data, (15 more...)

arXiv.org Artificial Intelligence

2503.01886

Country: Asia > Middle East > Republic of Türkiye (0.04)

Genre:

Research Report (1.00)
Financial News (0.72)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Review for NeurIPS paper: Contrastive learning of global and local features for medical image segmentation with limited annotations

Neural Information Processing SystemsJan-26-2025, 15:43:49 GMT

Weaknesses: There are a number of small weakness to the approach, the technique to some degree depends on well registered images and there are a number of extra hyper parameters introduced, such as the number of partitions to use per 3D volume, the number of pre-trained decoder blocks to use and the region size. I would expect these aspects to be largely problem dependent, and the degree to which results would also be improved on other problems is therefore somewhat unclear. I do not think this invalidates the above comment. Aside from the approach itself, I would also have liked some information on training time and convergence. How easy is this to setup, train and add to existing training processes?

joint training, local feature, medical image segmentation, (7 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.40)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Reviews: Approximate Inference Turns Deep Networks into Gaussian Processes

Neural Information Processing SystemsJan-26-2025, 14:03:26 GMT

There's some space to improve for the experiments. I think the main contribution of this paper is proposing a method to transform the complicated neural network structure to a nonlinear feature mapping function, so that they can linearly separate the weight and feature mapping. Given the feature mapping, kernels/correlations and posterior distributions over output functions can be explicitly built for BNN (or DNN). Therefore, I would expect to see 1. What does this feature mapping look like? I think the authors show the kernel instead of the mapping itself.

approximate inference turn deep network, feature map, kernel, (10 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.37)

Add feedback

Reviews: Deep Generalized Method of Moments for Instrumental Variable Analysis

Neural Information Processing SystemsJan-21-2025, 22:12:28 GMT

Originality: This work builds on recent work on adapting deep networks for use with instrumental variables (DeepIV [Hartford et al 2017] & Adversarial GMM (AGMM) [Lewis & Syrgkanis 2018]) but adapts the optimally weighted GMM [Hansen 1982] (OWGMM) for the task. AGMM is probably most similar in that it is also an adversarial loss, but the variational reformulation presented in this paper results in a far simpler algorithm. Quality: I thought this was great paper. The variational reformulation of OWGMM leads to a far simpler objective function that neatly leverages the explosion of recent work in adversarial learning (GANs, etc.) by replacing a large number of moment conditions with a single adversarial network. That said, given that the method appears useful in practice, I would have liked to see more detailed experiments on the practical considerations.

deep generalized method, instrumental variable analysis, poly2sls, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.77)

Add feedback